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Abstract: In bacteria, glycogen or oligosaccharide accumulation involves 
glucose- 1 -phosphate partitioning into either ADP-glucose (ADP-Glc) or UDP-Glc. Their 
respective synthesis is catalyzed by allosterically regulated ADP-Glc pyrophosphorylase 
(EC 2.7.7.27, ADP-Glc PPase) or unregulated UDP-Glc PPase (EC 2.7.7.9). In this work, 
we characterized the UDP-Glc PPase from Streptococcus mutans. In addition, we 
constructed a chimeric protein by cutting the C-terminal domain of the ADP-Glc PPase 
from Escherichia coli and pasting it to the entire S. mutans UDP-Glc PPase. Both proteins 
were fully active as UDP-Glc PPases and their kinetic parameters were measured. The 
chimeric enzyme had a slightly higher affinity for substrates than the native S. mutans 
UDP-Glc PPase, but the maximal activity was four times lower. Interestingly, the 
chimeric protein was sensitive to regulation by pyruvate, 3-phosphoglyceric acid and 
fructose- 1,6-bis-phosphate, which are known to be effectors of ADP-Glc PPases from 
different sources. The three compounds activated the chimeric enzyme up to three-fold, 
and increased the affinity for substrates. This chimeric protein is the first reported 
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UDP-Glc PPase with allosteric regulatory properties. In addition, this is a pioneer work 
dealing with a chimeric enzyme constructed as a hybrid of two pyrophosphorylases with 
different specificity toward nucleoside-diphospho-glucose and our results turn to be 
relevant for a deeper understanding of the evolution of allosterism in this family of enzymes. 

Keywords: protein engineering; allosteric regulation; pyrophosphorylases evolution; 
UDP-glucose; ADP-glucose 



1. Introduction 

The fate of Glc-lP in sugar anabolism involves a first step where the Glc moiety is "activated" by 
the formation of a nucleoside-diphospho-glucose (NDP-Glc) catalyzed by different NDP-Glc 
pyrophosphorylases (NDP-Glc PPases). Later, diverse glycosyl transferases with specificity toward a 
particular NDP-Glc lead the monosaccharide to a variety of carbohydrate metabolic routes. In general, 
in bacteria, there are two major biochemical roles for nucleotide-linked sugars: as intermediates in the 
formation of monosaccharides used in the production of complex carbohydrates, via UDP-Glc or as 
glycosyl donors for glycogen synthesis, using ADP-Glc [1,2]. These two key metabolites are products 
of either UDP-Glc or ADP-Glc PPase, through a reaction that requires a divalent metal ion 
(physiologically Mg 2+ ): U(A)TP + Glc-lP ^ U(A)DP-Glc + PP ; . Other specific NDP-sugar PPases 
complement the metabolic scenario for the production of the multiple mono-, oligo-, and 
poly-saccharides in the cell, which are found as free components or covalently bound to proteins 
and lipids [3,4]. 

UDP-Glc PPase (EC 2.7.7.9) is ubiquitously distributed in all types of organisms, and it plays a 
critical role in carbohydrates metabolism [5]. Significant differences at the level of amino acids 
sequence and three-dimensional structure found between the enzymes from prokaryotes and 
eukaryotes imply that they are not homologous. Eukaryotic UDP-Glc PPases are bigger than those 
found in bacteria [5,6], and the enzyme from Entamoeba histolytica (and probably from all protozoa) 
was recently characterized as being regulated by redox modification of critical cysteinyl residues [7]. 
Many bacterial UDP-Glc PPases have been characterized [8-13], and the crystallographic structures of 
the enzyme from Escherichia coli [14], Sphingomonas elodea [15] and Corynebacterium glutamicum [16] 
have been elucidated. The prokaryotic UDP-Glc PPase is a dimeric/tetrameric protein formed by a 
single subunit of -35 kDa with a relatively high specific activity and specificity for Glc-lP 
andUTP [8-13]. 

A main characteristic among prokaryotic NDP-sugar PPases (including UDP-Glc PPase) is that 
they are non-regulated enzymes. However, ADP-Glc PPase (EC 2.7.7.27) is an exception in that it 
catalyzes the key regulatory step in the pathway for glycogen and starch biosynthesis in bacteria and 
plants, respectively. Most ADP-Glc PPases characterized so far are allosterically regulated by 
metabolites that are principal intermediates in the major carbon assimilation pathway in the respective 
organism [1,2]. The three-dimensional structure of the homotetrameric forms of the enzyme from 
potato tuber and Agrobacterium tumefaciens has been recently solved by X-ray crystallography [17,18]. 
Structural studies have determined that ADP-Glc PPases are larger than other prokaryotic NDP-sugar 
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PPases. The former enzymes have an N-terminal catalytic domain (structurally similar to all PPases) 
that contains the active site, plus an additional C-terminal domain (absent in other PPases). On the 
basis of different studies [19-25], it has been proposed that the distinctive C-domain in ADP-Glc 
PPases is functionally related to allosteric regulation. Herein, we report the molecular cloning and 
heterologous expression of the gene coding for UDP-Glc PPase from Streptococcus mutans. Also, we 
constructed a chimeric protein by fusing to the latter S. mutans enzyme the C-terminal domain of the 
E. coli ADP-Glc PPase. The resulting hybrid protein retained UDP-Glc PPase activity and exhibited 
allosteric properties, being activated by 3-phosphoglycerate (3-PGA), fructose- 1,6-bis-phosphate 
(Fru-l,6-bisP) and pyruvate (Pyr). Our results have an impact on understanding the 
structure-to-function relationship between domains in PPases as well as the strategic changes driven by 
evolution to awaken allosterism in proteins. 

2. Results and Discussion 

2.1. Isolation and Analysis of the Gene Coding for UDP-Glc PPase in S. mutans and Construction of 
the Chimeric Protein 

The genome elucidated for S. mutans UA159 indicates the presence of a single gene coding for a 
putative UDP-Glc PPase (SMU_322c; Gene ID: 1029376) [26]; known as galU according to previous 
reports [6,9,10,27] or gtaB in other sources [8,28]. To determine the functional role of galU in 
S. mutans we amplified this single gene from S. mutans ATCC 25175 using specific primers properly 
designed (see details under Experimental Section) based on the database information available for 
S. mutans UA159 [26]. The amplified gene (the sequence of which was deposited in NCBI; GenBank 
accession number KC626324) codes for a protein 100% identical to the one in the genome of the 
reference strain. The S. mutans galU gene codes for a protein (SVnwGalU) with a theoretical 
molecular mass of 33.9 kDa and a 33.8% and 40.4% identity with UDP-Glc PPases from 
C. glutamicum and Helicobacter pylori, respectively. The former was used to elucidate the active site 
geometry in this type of enzymes [16] and the H. pylori protein for determining the enzymatic reaction 
mechanism (which was bi-bi ordered) [29]. Also, the protein coded by S. mutans galU shares similar 
identity to the UDP-Glc PPase from E. coli (41.5%); Sphingomonas elodea (41.1%) and 
Streptomyces coelicolor (43.8%), which have been structurally and kinetically characterized [8,14,15]. 
The S. mutans GalU has a high identity (85.1%) with the UDP-Glc PPase from S. pneumoniae [9,27]. 

The gene amplified from S. mutans ATCC 25175 was utilized for two main purposes (Figure 1): 
(i) to insert it into the pRSET-B expression plasmid, looking to produce the recombinant UDP-Glc 
PPase (SmwGalU) as a tool for the structural and kinetic characterization; and (ii) to construct a gene 
coding for a chimeric protein, seeking to investigate the functionality of key domains in PPases. 
Amino acid sequence alignment between UDP-Glc and ADP-Glc PPases shows that these proteins 
share a homologous N-terminal domain, which is involved in catalysis. However, the enzymes specific 
for ADP-Glc are larger proteins with an extra C-terminal domain that is presumably related to 
allosteric properties as indicated above (Figure la, amplified in Figure SI). To advance in the latter 
hypothesis, we produced a hybrid protein by fusing the putative regulatory C-domain (from the E. coli 
ADP-Glc PPase) to the C-terminal of the non-allosteric UDP-Glc PPase from S. mutans (Figure lb). 
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Figure 1. Schematic representation for chimeric ,Sm«GalU-A294iscoGlgC construction, 
(a) Alignment between different ADP-glucose pyrophosphorylases (ADP-Glc PPases) and 
UDP-Glc PPases (Ref: Solatium: ADP-Glc PPase small subunit from potato; AtuGlgC, 
A. tumefaciens ADP-Glc PPase; £coGlgC, E. coli ADP-Glc PPase; Chimera, chimeric 
SmwGalU-A294 J E'coGlgC; SmwGalU, S. mutans UDP-Glc PPase; HpyGaW, H. pylory 
UDP-Glc PPase; Cg/GalU, C. glutamicum UDP-Glc PPase). E. coli ADP-Glc PPase P 295 is 
over-marked; (b) Construction of chimeric enzyme: The C-terminus belonging to the 
ADP-Glc PPase from E. coli (yellow) was "added" to the entire UDP-Glc PPase from 
S. mutans ATCC 25175 (light-blue); (c) Plasmids used to express both proteins. 
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Figure 1 details the "cut and paste" strategy used to construct the chimeric gene coding for the 
hybrid protein SVnwGalU-A294iicoGlgC. Hence, we pasted a DNA fragment cut from the E. coli glgC 
gene, which codes for the 137 C-terminal amino acidic residues of the ADP-Glc PPase (starting at 
codon belonging to P 295 residue, which is in a connector loop) to the entire S. mutans galU gene. The 
resulting 1329 pb DNA piece was used to construct a pRSET-B derivative plasmid (Figure lc) suitable 
to express a 443 amino acid chimeric protein with a theoretical molecular mass of 49.4 kDa. 
Previously chimeric enzymes were obtained and characterized after switching/swapping N- and 
C-terminal domains between ADP-Glc PPases (specifically involving the enzymes from E. coli and 
A. tumefaciens [19] or from prokaryotic and eukaryotic photosynthetic organisms [30]). This is the 
first report regarding a hybrid UDP-Glc PPase/ ADP-Glc PPase protein. 
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2.2. Structural and Kinetic Characterization ofSmuGalU and Chimeric SmuGalU-A294EcoGlgC 

Both SmwGalU and chimeric l SVnwGaiU-A294E'coGlgC were produced as recombinant proteins via 
heterologous expression in E. coli using the respective pRSET-B derivative plasmid (see details in 
Figure 1). Thus, both proteins were obtained having a His-tag fused at the N-terminus. When E. coli 
BL21 (DE3) was used as a host, native SVnwGalU was over-expressed in soluble fractions but 
S>nwGalU-A294iscoGlgC was mostly recovered in the pellet (inclusion bodies). The chimeric protein 
appeared in soluble fractions when the expression host was turned to E. coli BL21 (DE3) pLysS. 
Culture conditions and procedures were similar for both hosts (see Experimental Section). After 
inducing expression and obtaining crude extracts from the transformed cells, the recombinant 
His-tagged proteins were purified by immobilized metal (Ni 2+ ) affinity chromatography, after which 
they reached a high degree of purity according to sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE) analysis (Figure 2a). Size exclusion chromatography on Superdex 200 
revealed that under soluble conditions SVnwGalU and S , mwGalU-A294£'coGlgC arranged 
homotetrameric quaternary structures of a molecular mass of -150 kDa and -200 kDa, respectively 
(Figure 2b). Results obtained with SmuGaWJ are in good agreement with tetrameric structures 
previously determined for UDP-Glc PPases from different sources, e.g., both crystallized enzymes 
from E. coli and S. elodea [14,15]. On the other hand, the native form found for 
5>n«GalU-A294iicoGlgC indicates that this chimeric protein shares similar oligomeric properties to the 
polypeptides that form its hybrid structure. 

Figure 2. (a) Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) of 
recombinant UDP-Glc PPase from S. mutans and chimeric ,S>nwGalU-A294iscoGlgC after 
purification. Lane 1: Molecular weight markers; Lane 2: chimeric SVn«GalU-A294iscoGlgC; 
Lane 3: His-tagged SVnwGalU Purifications were conducted as described in the Experimental 
Section; (b) Molecular mass determination, performed from size exclusion chromatography, 
as detailed in the Experimental Section. 
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Purified S/rawGalU and l SmwGalU-A294 J EcoGlgC had UDP-Glc PPase activity. It catalyzed the 
synthesis of UDP-Glc and PP; from Glc-lP and UTP (in the presence of 3 mM Mg ) with specific 

2_|_ 

activities of 40 and 1 1 U/mg, respectively. It is well known that a divalent metal ion (commonly Mg ) 
is an essential cofactor for ADP-Glc PPases [1,2] and UDP-Glc PPases [5,8,10,11,16]. In our hands, 
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the activities of SmuG&WJ and l SVn«GalU-A294E'coGlgC were strictly dependent of Mg . Both proteins 
were fully active at ~3 mM and inhibited by higher concentrations of the divalent cation (Figure 3). 

2+ 2+ 

Other metal ions could replace Mg , as illustrated by Figure 4. At 0.5 mM Mn SmwGalU reached a 

2+ 2+ 

four- fold higher activity than with 3 mM Mg , but the enzyme was inhibited at higher Mn levels. 
This enzyme was also active with Cd , Ca , Co , Ni , Cu and Cr , with higher activity at 0.5 mM 
of the metal ion and inhibition at different levels with higher amounts of the respective divalent 
cofactor (Figure 4a). Concerning l SVnwGaiU-A294iscoGlgC, the protein was as active as with 3 mM 
Mg 2+ when assayed with 0.5 mM of Mn 2+ , Ca 2+ , Co 2+ , or Cu 2+ , whereas with 0.5 mM Cd 2+ , Ni 2+ , or 
Cr 2+ it showed only about half the activity. Except for Mn 2+ and Ca 2+ , the other divalent cations 
inhibited the hybrid protein at higher concentrations (Figure 4b). 

2_|_ 

Figure 3. Mg curves for both SmwGalU (filled circles) and chimeric 
l SmwGalU-A294iicoGlgC (empty circles). 
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A similar pH-dependence of activity was observed for SVnwGalU and l SVnwGalU-A294iscoGlgC 
when studied in the range of pH 6.0-10.0. Both proteins exhibited a maximum at pH 8.0, and sharply 
decreased below pH 7.0 (data not shown). These results are in good agreement with those previously 
reported for pneumococcal UDP-Glc PPase, which is fully active at pH 8.0-8.5 [10], as well as for 
ADP-Glc PPases from different sources, where optimal activity is around pH 8.0 [1,2]. After the 
behavior of the proteins in respect to pH and the requirement for divalent metal ions were studied, the 
further kinetic characterization was performed at pH 8.0 and 3 mM Mg 2+ as its saturating 
concentration. The kinetic parameters of SVnwGalU and l SVnwGaiU-A294iscoGlgC for the substrates 
Glc-lP, UTP and divalent cofactor Mg + are summarized in Table 1. The analysis of these parameters 
is needed to compare the kinetic properties of these recombinant proteins and understand the 
functionality of the different structure domains in PPases. 
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Figure 4. Use of different divalent metal ions by SVnwGalU (a) and chimeric 
5 , mwGalU-A294£'coGlgC (b). Empty bars correspond to 0.5 mM, sparse filled bars to 
2.5 mM and dense filled bars to 5 mM of the corresponding metal analyzed. Controls are 
related to the enzyme activity measure in the same conditions, using 3 mM Mg 2+ . 
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Table 1. Kinetic parameters for SmuG&WJ and l SmwGaiU-A294iicoGlgC. Parameters were 
calculated from averaged data from three independent experiments, as detailed in the 
Experimental Section. 



Substrate 


Kinetic parameter 


SmuGalU 


57MwGalU-A294 J EcoGlgC 




V max (U/mg) 


62.1 ±2.2 


15.4 ±0.5 


UTP 


5o,5 (mM) 


0.68 ±0.06 


0.24 ± 0.02 






1.5 ±0.2 


1.2 ±0.1 


V ma J 5*0.5 


(U/mg mM) 


91.3 


64.2 


Glc-lP 


5 0 .5 (mM) 


0.090 ± 0.005 


0.060 ± 0.005 






1.3 ±0.1 


1.4 ±0.1 


V ma J 5*0.5 


(U/mg mM) 


690 


257 


Mg 2+ 


5 0 .5 (mM) 


0.81 ±0.06 


0.62 ±0.05 






2.4 ±0.6 


2.7 ±0.6 


V ma J 5o.5 


(U/mg mM) 


76.7 


24.8 



■SmMGalU slightly deviated from a hyperbolic behavior for both UTP and Glc-lP displaying positive 
cooperativity. Saturation curves for Mg 2+ were even more sigmoidal (Table 1). Apparent affinities for 
the different substrates/cofactor exhibited by SmuGalU were in the same order of magnitude as those 
reported for other bacterial UDP-Glc PPases so far characterized [8,10,11,31,32]. The recombinant 
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enzyme reached a V max of 62 U/mg (Table 1), which was significantly higher than the UDP-Glc PPases 
from E. coli [32], S. elodea [31] and S. pneumoniae [10], although not as high as UDP-Glc PPase from 
S. coelicolor [8] and in the same order as the enzyme from Xanthomonas spp. [11]. Concerning 
Sm«GaiU-A294E'coGlgC, saturation curves also showed a slight or marked deviation from the 
hyperbolic behavior for the substrates or the divalent cation cofactor, respectively (Table 1). The 
chimeric protein increased three-fold in the apparent affinity for UTP regarding the UDP-Glc PPase 

2_|_ 

from S. mutans, whilst affinities for Glc-lP and Mg remained at the same level. Besides, the 
chimeric protein exhibited a four- fold lower V max when compared with SVnwGalU (Table 1). Then, the 
results suggest that the C-terminal domain from the fusion of the E. coli ADP-Glc PPase to the 
S. mutans UDP-Glc PPase modifies the enzyme to acquire a conformation with slightly reduced ratio 
VmzJSo.5 (analogous to ratio V max /K m , defined as catalytic efficiency for hyperbolic kinetics) (Table 1). 
The kinetic properties exhibited by SVnwGalU-A294iscoGlgC are remarkable for a hybrid protein 
composed by domains of PPases with different specificity. Although our studies are the first dealing 
with kinetic characterization of an UDP-Glc PPase/ADP-Glc PPase chimeric enzyme, they can be 
compared with previous hybrids between ADP-Glc PPases. Thus, when N- and C-terminal domains 
from the E. coli and A. tumefaciens ADP-Glc PPases were switched, the chimeric construct exhibited 
activities between 20% and 30% of the value of the original wild-type enzymes [19]. 

In general, prokaryotic NDP-sugar PPases are relatively specific for their substrates, although some 
exceptions have been reported. For example, dTTP is a substrate in some bacterial UDP-Glc 
PPases [8,11,32], whereas GDP-mannose (GDP-Man) PPases from M. tuberculosis and from 
Leptospira interrogans exhibit promiscuity in the use of the nucleotide triphosphate (NTP) 
substrate [33,34]. Different NTPs (ATP; UTP; ITP; GTP; CTP; dTTP) at 0.1, 0.5 and 2.5 mM final 
concentration were assayed as alternative substrates for SVnwGalU and SVnwGalU-A294iscoGlgC. It was 
observed that S. mutans UDP-Glc PPase was active with dTTP as an alternative to UTP. The activity 
in the synthesis direction of dTDP-Glc was four-fold lower than the production of UDP-Glc, although 
the enzyme showed similar affinity for either NTP (6*0.5 0.54 mM, at 1 mM Glc-lP and 3 mM Mg 2+ ). 
Results with SmuGalV are in accordance with previous reports indicating that these enzymes 
(from a prokaryotic source) are capable of utilizing dTTP [8,11,32]. Conversely, chimeric 
l SVnwGalU-A294iscoGlgC exhibited a high specificity toward UTP. 

2.3. The Chimeric SmuGalU-J 2 94EcoGlgC Protein Exhibits Allosteric Properties 

Activation-inhibition assays were performed for both recombinant proteins under study (SVnwGalU 
and l SVn«GalU-A294iicoGlgC) testing several compounds known to be important effectors of 
ADP-Glc PPases from different sources [1,2]. The metabolites utilized were phosphoeno/pyruvate 
(PEP); fructose- 1,6-bisphosphate (Fru-l,6-bisP), pyruvate (Pyr), 3-phosphoglyceric acid (3-PGA), 
Glc-6P, ribose-5P (Rib-5P), fructose-6P (Fru-6P), Man- IP, Man-6P, Pi, AMP, ADP, NAD + , NADH, 
NADP + and NADPH. These effectors were tested at up to 10 mM (Pyr was even varied up to 150 mM) 
under activity assay conditions that were saturating and non-saturating in respect to the amount of 
substrates in the medium. None of the metabolites analyzed affected the activity of SVnwGalU. This 
insensitivity to regulation is in good agreement with data reported for bacterial UDP-Glc PPases, 
which in general are not regulated by allosteric effectors [5]. On the other hand, Pyr, Fru-l,6-bisP and 
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3 -PGA activated chimeric 5>m/GalU-A294iscoGlgC (Figure 5) (none of the molecules analyzed 
inhibited the enzyme). This fact per se is one of the striking results of this work, since it constitutes the 
first report regarding a protein having UDP-Glc PPase activity (or a PPase activity other than 
ADP-Glc PPase) and being subjected to allosteric regulation. 

Figure 5. Saturation curves for chimeric SVnwGalU-A294iicoGlgC effectors: (a) 3-PGA, 
(b) Fru-l,6-bisP and (c) Pyr. Filled circles belong to SmuGaXXJ and empty circles to 
chimeric l Sm«GalU-A294E'coGlgC enzyme. Reactions were conducted at 1 mM Glc-lP, 
1 mM UTP and 3 mM Mg 2+ . Values of relative activity were calculated based on activities 
measured in the absence of effector, specifically 40 U/mg and 1 1 U/mg for SVnwGalU and 
SVnwGalU-A294iicoGigC, respectively. 




0 ' 5 10 15 20 0 2 4 6 8 10 0 30 60 90 120 150 
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Saturation curves for each metabolite activating ,SmwGalU-A294iscoGlgC are detailed in Figure 5. 
Both, 3-PGA and Pyr, increased the V max of the hybrid fused protein by 2.4- and 2.6-fold, respectively, 
while Fru-l,6-bisP activation was 1.6-fold. Respective to the apparent capacity for binding of the 
allosteric effector, the chimeric enzyme behaved with the highest apparent affinity for Fru-l,6-bisP 
{Aq_ 5 2.0 mM), and with positive cooperativity (ra H 1.5). For 3-PGA the behavior was hyperbolic 
(«h 1.0) with anv4o.5 value of 5.4 mM, whereas Pyr depicted a sigmoidal (hh 1.5) saturation curve from 
which an Aqs of 30 mM could be calculated (Figure 5). It is worth noting that this apparent affinity 
value of the chimeric protein calculated for Pyr is in the same order as what was found for the 
interaction (also activating) of this metabolite with the E. coli ADP-Glc PPase [19]. 3-PGA and 
Fru-l,6-bisP are the main activators of plants and some bacterial ADP-Glc PPases [1,35] as well, although 
with higher affinities. 

It was valuable to analyze the kinetic parameters of ,S'mwGalU-A294iicoGlgC toward the substrates 
and the divalent ion cofactor when assayed in presence of each allosteric activating compound 
(Figure 6). Thus, saturation curves for UTP, Glc-lP and Mg were conducted in presence of either 
7.5 mM 3-PGA, 50 mM Pyr or 10 mM Fru-l,6-bisP. Figure 6a shows how the relative apparent 
affinity for each substrate/cofactor was affected by the respective allosteric activator. In general, the 
allosteric effectors decreased the 6*0.5 values (increasing the apparent affinity) for substrates of the 
chimeric enzyme. Notably, Pyr and Fru-l,6-bisP doubled the apparent affinity of the hybrid protein for 
UTP (the latter with an increment in the sigmoidal behavior). Pyr and 3-PGA also doubled the relative 
affinity for Glc-lP. On the other hand, the affinity of the chimeric enzyme for Mg 2+ was not affected 
by 3-PGA, but it was augmented two- to three-fold by Pyr or Fru-l,6-bis P (Figure 6a). Also, the V max 
of the chimeric enzyme in the absence of an allosteric effector (15.4 U/mg, see Table 1) was enhanced 
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to 24.6, 37.0 or 40.1 U/mg by Fru-l,6-bisP, Pyr or 3-PGA, respectively. The combination of the 
increase in affinity and of V max exerted by the allosteric effectors on the chimeric enzyme determines 
that they enhance the ratio V max /So, 5 , which measures catalytic efficiency (Figure 6b), between 
two- and five-fold. These results highlight the sensitivity to allosteric effectors acquired in the 
chimeric SmwGalU-A294£coGlgC. 

Figure 6. Modification by allosteric activators of l SVnwGaiU-A294iscoGlgC (a) substrates 
apparent affinities; (b) catalytic efficiencies (Fmax/Sos). The enzyme was analyzed with no 
effector (white bars); 7.5 mM 3-PGA (horizontal line bar), 50 mM Pyr (oblique line bar) or 
10 mM Fru-l,6-bisP (vertical line bar). Relative affinity in (a) measures the ratio between 
the 5*0.5 values determined in the absence over that in the presence of the stated amount 
of effector. 




On the basis of the amino acid sequence alignment shown in Figure S2, we constructed a homology 
model for the chimeric SVn«GaiU-A294iscoGlgC protein (Figure 7). The structure was modeled using 
four simultaneous templates. Two templates representing the known atomic coordinates of prokaryotic 
(C. glutamicum and H. pylori) UDP-Glc PPases modeled the SmwGalU domain from the N-terminus of 
the chimeric protein. The C. glutamicum UDP-Glc PPase structure particularly allowed the location of 
the product UDP-Glc in the model. Then again, the architecture of the C-terminal domain of the hybrid 
protein (A294£coGlgC) was fitted from the templates corresponding to known structures determined 
for two ADP-Glc PPases (the enzyme from A. tumefaciens and the small subunit from the potato tuber 
enzyme). The A. tumefaciens structure was mostly good for this, since it is a prokaryotic enzyme like 
E. coli GlgC, whereas the potato tuber structure was useful to locate the allosteric ligand in the 
chimeric protein. Interestingly, the model depicted in Figure 7 shows consistency with the biochemical 
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properties determined for the chimeric protein. Thus, the predicted structure arranges a correct folding 
for an UDP-Glc PPase catalytic domain (including a UDP-Glc binding site) as well as a functional 
spatial distribution of the C-term allosteric domain, resembling its allocation in ADP-Glc PPases 
(Figure 7). When the C-domain is added to the N-terminal domain, it creates a pocket that could serve 
to accommodate binding to small molecule regulators. This is illustrated by the modeling of two 
sulfates that in the potato tuber enzyme are located in the inhibitory phosphate site interacting with 
residues in the C-terminus that are responsible for activation [18]. 

Figure 7. Modeling of the chimeric enzyme. The model was obtained as indicated in the 
Experimental Section. The UDP-Glc PPase domain from S. /nutans is depicted in cyan, 
whereas the C-domain from E. coli ADP-Glc PPase is depicted in pink. Two sulfates 
modeled from the structure of the potato tuber ADP-Glc PPase are shown, together with 
the product UDP-Glc. 




3. Experimental Section 

3.1. Chemicals 

Antibiotics, isopropyl-[3-tfiiogalactoside (IPTG), oligonucleotides, UTP, Glc-lP, 3-PGA, 
Fru-l,6-bisP and Pyr were obtained from Sigma-Aldrich (St. Louis, MO, USA). All other chemicals 
were of the highest quality available. 

3.2. Bacteria and Plasmids 

E. coli Top 10 F' cells and pGEM®T Easy vector were used for cloning purposes. Expression of 
galU and qgalU was performed using pRSET-B vector (Invitrogen, Carlsbad, CA, USA) and E. coli 
BL21 (DE3) as host. In addition, galU was also expressed using pET24 vector (Novagen, Madison, 
WI, USA). DNA manipulations and E. coli cultures as well as transformations were performed 
according to standard protocols [36]. 

3.3. Amplification of galU Gene from S. mutans and Construction of Chimeric galU 

The 921 pb gene coding for UDP-GlcPPase, SmugalU, was amplified using S. mutans ATCC 25175 
genomic DNA as a template and gene specific Smul/Smu2 primers, designed according to available 
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information in the GenBank database [37] for S. mutans UA159 for gene coding UDP-Glc PPase 
(Gene ID: 1029376). All oligonucleotides used are detailed in Table 2. The 1350 pb gene coding for 
the chimeric protein was obtained by overlap extension PCR [38]. In a first step, two independent PCR 
reactions were conducted, using the pair of primers Smul/Qmr2 (reaction 1) and the pair Qmrl/Qmr3 
(Reaction 2). Both Qmrl and Qmr2 primers were designed in order to hybridize SmugalU and 
EcoglgC region, respectively, starting at the codon for Pro , with extra nucleotides to allow fragment 
fusion. Reaction 1 was conducted using a plasmid harboring SmugalU, while Reaction 2 was used as a 
template for the plasmid containing the glgC gene from E. coli. Products from Reaction 1 and Reaction 2 
were purified and then were used together as templates in a final PCR step, using Smul/Qmr3 primers 
to obtain the complete chimeric gene. 



Table 2. Oligonucleotide primers employed to amplify SmugalU gene and to construct the 
gene coding for the S>nMGaiU-A294iscoGlgC chimeric protein. Restriction sites are underlined. 



Primer 


Sequence 


Restriction site 


Smul 


5'-GGATCCCATGCCAAGTAAAAAAGTCAG-3' 


BamUl 


Smu2 


5 '-GAATTCCTTAATCCGAGTTCTTTTGAG-3 ' 


EcoKl 


Qmrl 


5'-CTCGGACCCGGAACTGGATATGTACGATC-3' 




Qmr2 


5'-GTTCCGGGTCCGAGTTCTTTTGAGTCG-3' 




Qmr3 


5 '-CTCGAGTTATCGCTCCTGTTTATGCCC-3 ' 


Xhol 



All PCR reaction mixtures (50 uL) contained 100 ng of genomic DNA, 2 pg of each primer; 
0.2 mM of each dNTP; 1.5 mM Mg 2+ and 1U Taq or Pfu DNA polymerase (Fermentas, St. Leon-Rot, 
Germany). Standard conditions of PCR were used for 30 cycles: denaturation at 94 °C for 1 min; 
annealing at 55 °C for 1 min and extension at 72 °C for 2 or 4 min (depending if Taq or Pfu were used, 
respectively) with a final extension of 10 min at 72 °C. PCR reaction mixtures were electrophoretically 
defined in a 1% (w/v) agarose gel and purified with Wizard SV gel & PCR Clean Up system 
(Promega, Fitchburg, WI, USA), according to the manufacter's instructions. 

3.4. Cloning o/SmugalU and Chimeric SmugalU-zl294EcoglgC Genes 

Amplified genes were cloned into the T-tailed plasmid pGEM-TEasy and identities were confirmed 
by DNA sequencing. The SmugalU gene was sub-cloned into pRSET-B BamHl/EcoRI sites to achieve 
a His-tagged protein at N-terminal. The chimeric gene SmugalU-A294EcoglgC was inserted between 
BamHl/Xhol from pRSET-B. Thus, both SVnwGalU and the Sm«GalU-A294 J E'coGlgC enzyme could be 
obtained with an N-terminal His-tag (see Figure lc). 

3.5. Enzymes Expressions and Purifications 

Plasmid harboring SmugalU was used to transform E. coli BL21 (DE3) competent cells. 
Transformed cells were grown in YT2X medium at 37 °C, 200 rpm, until the A 6 oo nm reached 0.6-0.8 
and were induced with 0.8 mM IPTG at 25 °C overnight. Instead, soluble expression of chimeric 
l SmwGalU-A294iscoGlgC was achieved using E. coli BL21 (DE3) pLysS as a host strain and inducing 
with 0.4 mM IPTG at 25 °C overnight in LB medium. 
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For expressing purposes, 1 L cultures in conditions detailed above were grown for each protein. 
Cells were harvested by centrifugation at 5000 rpm for 10 min at 4 °C and resuspended in 5 mL of 
buffer H (20 mM Tris-HCl pH 8.0, 500 mM NaCl, 10 mM imidazole) per g of cells. Supernatants were 
obtained after cell disruption by sonication on ice, eight times for 30 s with 60 s intervals and 
centrifugation at 16,000 rpm for 20 min at 4 °C. 

SmwGalU and chimeric l SVnwGaiU-A294iscoGlgC were expressed as TV-terminus His-tag fusions, in 
order to facilitate their subsequent purification. Enzymes were purified by affinity chromatography, 
using Ni-NTA Agarose resin (Invitrogen, Carlsbad, CA, USA) according to the protocol supplied by 
the manufacturer. Briefly, crude extract fractions prepared in buffer H were loaded onto previously 
equilibrated columns. After extensively washing with buffer H, samples were eluted by means of a 
lineal gradient to buffer I (20 mM Tris-HCl pH 8.0, 500 mM NaCl, 300 mM imidazole). Elution 
fractions containing the corresponding recombinant enzyme were analyzed by SDS-PAGE [39] to 
check for purity. For each recombinant enzyme, active fractions eluted from the Ni-NTA column were 
pooled, dialyzed to remove imidazole and supplemented with 10% (v/v) glycerol. Both recombinant 
enzymes were stable for at least six months when stored at -80 °C under the above specified 
respective conditions. 

3.6. Protein Methods 

Protein concentration was determined by the modified Bradford assay [40] using bovine serum 
albumin as a standard. Recombinant proteins and purification fractions were defined electrophoreticaly 
in sodium dodecyl sulphate polyacrylamide gels (SDS-PAGE) according to [39]. Gels were stained 
with CoomassieBrilliant Blue. 

3. 7. Determination of Activity Optimal pH 

Bis-Tris-propane [2,2'-(Propane-l,3-diyldiimino)bis[2-(hydroxymethyl)propane-l,3-diol] (Sigma, 
St. Louis, MO, USA), which has a wide buffering range (from pH 6.0 to pH 10.0), and tricine (Sigma, 
St. Louis, MO, USA) A^-(2-Hydroxy-l,l-bis(hydroxymethyl)ethyl)glycine (from pH 7.5 to pH 9.0) 
were used to calculate the optimal pH for SVnwGalU and chimeric l SVnwGaiU-A294iscoGlgC activities. 
Measures were conducted in the UDP-Glc synthesis way. 

3.8. Molecular Mass Determination 

The molecular mass of the SmuG&WJ and the chimeric SVn«GaiU-A294iicoGlgC enzyme were 
determined by gel filtration using a Tricorn 5/200 column (GE Healthcare). A Gel Filtration 
Calibration Kit-High Molecular Weight (GE Healthcare) with protein standards including 
thyroglobulin (669 kDa), ferritin (440 kDa), aldolase (158 kDa), conalbumin (75 kDa) and ovoalbumin 
(44 kDa) was used. The column void volume was determined using a dextran blue loading 
solution (Promega, Fitchburg, WI, USA). 
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3.9. Enzyme Assays 

Synthesis of UDP-Glc was assayed by following the formation of P; (after hydrolysis of PP; by 
inorganic pyrophosphatase) by the highly sensitive colorimetric method previously described [41]. The 
standard reaction mixture contained 100 mM MOPS (pH 8.0), 3 mM MgCl 2 , 1.5 mM UTP, 0.2 mg/mL 
bovine serum albumin, 0.5 mU/uL yeast inorganic pyrophosphatase and a proper enzyme dilution. 
Assays were initiated by the addition of Glc-lP in a total volume of 50 uL. Reaction mixtures were 
incubated for 10 min at 37 °C and terminated by adding the Malachite Green reactive [41]. The 
complex formed with the released P; was measured at 630 nm with an ELISA EMax detector 
(Molecular Devices) and using (sodium PP;) as standard. 

One unit (U) of enzyme activity is equal to 1 umol of product formed per minute under the 
respective assay conditions specified above. 

3. 10. Calculation of Kinetic Constants 

Kinetic assays were performed using specified concentrations and conditions for all reaction 
mixture components. Saturation curves were performed by assaying the respective enzyme activity at 
saturating level of a fixed substrate and different concentrations of the variable substrate. The 
experimental data were plotted as enzyme activity (U/mg) versus substrate (or effector) concentration 
(mM), and kinetic constants were determined by fitting the data to the Hill equation as described 
elsewhere [20]. Fitting was performed with the Levenberg-Marquardt nonlinear least-squares 
algorithm provided by the computer program Origin™. Hill plots were used to calculate the Hill 
coefficient (n H ), the maximal velocity (F max ), and the kinetic constants that correspond to the activator, 
substrate or inhibitor concentrations giving 50% of the maximal activation (^0.5), velocity (S0.5) or 
inhibition (/0.5). All kinetic constants are the mean of at least three sets of data, which were 
reproducible within ±10%. 

3.11. Homology Modeling 

Modeling of the chimeric enzyme was performed using the program Modeller 9vl [42]. For that 
purpose, the structure was modeled using four simultaneous templates; the known atomic coordinates 
of the UDP-Glc PPase from C. glutamicum in complex with magnesium and UDP-Glc (Chain A, PDB 
code 2pa4), the apo structure of the UDP-Glc PPase from H. pylori (chain A, PDB code 3juj), the 
structure of the ADP-Glc PPase from A. tumefaciens (PDB code 3brk), and the structure of potato 
tuber ADP-Glc PPase in complex sulfates in the putative regulatory site. In this way, the coordinates of 
the UDP-Glc PPases provide the structural information to build the S. mutans domain of the chimeric 
enzyme (N-terminus), and the ADP-Glc PPases the information needed to build the E. coli domain of 
the chimeric (C-terminus). The A. tumefaciens structure is particularly good for this, since it is a 
bacterial enzyme like E. coli, but the potato tuber structure provides information about where the 
sulfate ligands will go. Similarly, the C. glutamicum was used to place the product UDP-Glc in the 
model. The reliability of the model was evaluated using the programs Verify3D [43]. All the templates 
were manually structurally aligned to each other before the sequence alignment was performed with 
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the chimeric target. The validation of the model with Verify3D was already good and further iterations 
of the alignment were not necessary. 

4. Conclusions 

Herein, we present results that strongly support that fusion of the C-terminal domain of E. coli 
ADP-Glc PPase to the entire UDP-Glc PPase from S. mutans produces a chimeric protein with 
UDP-Glc PPase activity sensitive to allosteric activation by specific metabolites. To the best of our 
knowledge this is the first report on allostery by binding of small molecules to a UDP-Glc PPase (even 
more in an NDP-Glc PPase distinct of ADP-Glc PPase) from prokaryotic or eukaryotic origin [5]. 
Previous work has demonstrated that UDP-Glc PPases are poorly or not regulated, with exceptions of: 
(i) The barley enzyme, the activity of which is affected by the occurrence of different quaternary 
arrangements, being active as a monomer [44-46]; and (ii) the E. histolytica enzyme that responds to 
regulation after modification by oxidants and reducing agents of critical cysteine residues [7]. 

ADP-Glc PPases are allosteric enzymes catalyzing the key step in glycogen and starch synthesis in 
bacteria and plants, respectively [1,2]. Many studies have been conducted in order to identify the 

-} q 

amino acidic residues involved in the allosteric response. Thus, it has been proposed that Lys is 
important in the interaction with Fru-l,6-bisP with E. coli ADP-Glc PPase [47-49]. In addition, Arg 

39 33 45 

from A. tumefaciens ADP-Glc PPase (similar to Lys in the E. coli enzyme), Arg , and Arg are 
involved in allosteric effector binding [21]. In this case, it can be seen that these residues belong to the 
N-terminal domain of the bacterial ADP-Glc PPase, and a recent study has clearly established the 
importance of the E. coli ADP-Glc PPase TV-terminal domain in the activation of the enzyme [22]. In 
addition, results where the first amino acidic residues from the N-terminal domains were also removed 
showed the enzyme to be fully active without the allosteric activator [50,51] However, results obtained 
with chimeric enzymes obtained after switching the N- and C-terminal domains from E. coli and 
A. tumefaciens ADP-Glc PPases indicated that the C-terminal region was critical in determining the 
affinity and specificity to effectors [19,22]. This also suggests that in prokaryotic ADP-Glc PPases 
both N- and C-terminal domains are interconnected in the response of the enzyme to the allosteric 
effectors. In this context, our results with the chimeric enzyme l SVnwGaiU-A294iscoGlgC support this 
model, since we clearly demonstrated that the presence of a single C-terminal domain causes 
sensitivity to an allosteric activator to a previously non- allosteric UDP-Glc PPase. 

It has to be remarked that the activation pattern exhibited by chimeric l SVnwGalU-A294iscoGlgC is a 
novel feature acquired by the S. mutans UDP-Glc PPase after being transformed (by domain fusion) in 
the hybrid enzyme. The latter strongly supports the functional allosteric regulatory role of the 
C-terminal domain found in ADP-Glc PPase. Results also support the view that such a domain can be 
modularly fused to add allosteric properties to different PPases. It is tempting to speculate that 
evolution followed a similar strategy to modify one enzymes of such type (with specificity toward 
ADP-Glc), elongating the protein to trigger allosteric regulation in a general way. Later, critical 
changes in the C regulatory domain would confer specificity for different allosteric effectors within 
ADP-Glc PPases. We do not know the mechanism by which the N-terminal domain is activated in 
presence of the C-terminal domain and the proper activator, but our experiments indicate that the 
catalytic domain of NDP-Glc PPases may have an intrinsic property to have their activity modulated in 
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presence of certain interactions, even when they are not natively regulated. In this case, the presence of 
a foreign domain triggers the allosteric properties. 
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